AITopics | policy distance

Collaborating Authors

policy distance

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

JSON-Bag: A generic game trajectory representation

Nguyen, Dien, Perez-Liebana, Diego, Lucas, Simon

arXiv.org Artificial IntelligenceAug-4-2025

--We introduce JSON Bag-of-T okens model (JSON-Bag) as a method to generically represent game trajectories by tokenizing their JSON descriptions and apply Jensen-Shannon distance (JSD) as distance metric for them. Using a prototype-based nearest-neighbor search (P-NNS), we evaluate the validity of JSON-Bag with JSD on six tabletop games-- 7 Wonders, Dominion, Sea Salt and Paper, Can't Stop, Connect4, Dots and boxes--each over three game trajectory classification tasks: classifying the playing agents, game parameters, or game seeds that were used to generate the trajectories. Our approach outperforms a baseline using hand-crafted features in the majority of tasks. Evaluating on N-shot classification suggests using JSON-Bag prototype to represent game trajectory classes is also sample efficient. Additionally, we demonstrate JSON-Bag ability for automatic feature extraction by treating tokens as individual features to be used in Random Forest to solve the tasks above, which significantly improves accuracy on underperforming tasks. Finally, we show that, across all six games, the JSD between JSON-Bag prototypes of agent classes highly correlates with the distances between agents' policies. Defining features and representations for games and their corresponding distance/similarity metric is foundational for any task that requires game analysis. Designing agents to play a game in a certain way (either to optimize playing strength [1], model human players [2], or optimize playstyle diversity [3]) often requires hand-crafted features using domain knowledge. Automated game design and content generation requires defining game metrics to evaluate generated solutions [4]. In these tasks, instead of only optimizing for the targeted fitness functions, optimizing also for diversity and novelty in the solution population can produce better results [5] [3]. Diversity in the population is usually enforced by either defining behavior criteria that partition the search space [6] or using a distance metric to evaluate the novelty of new solutions [5].

evolutionary algorithm, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2508.00712

Genre: Research Report > New Finding (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.88)
(2 more...)

Add feedback

Measuring Policy Distance for Multi-Agent Reinforcement Learning

Hu, Tianyi, Pu, Zhiqiang, Ai, Xiaolin, Qiu, Tenghai, Yi, Jianqiang

arXiv.org Artificial IntelligenceJan-28-2024

Diversity plays a crucial role in improving the performance of multi-agent reinforcement learning (MARL). Currently, many diversity-based methods have been developed to overcome the drawbacks of excessive parameter sharing in traditional MARL. However, there remains a lack of a general metric to quantify policy differences among agents. Such a metric would not only facilitate the evaluation of the diversity evolution in multi-agent systems, but also provide guidance for the design of diversity-based MARL algorithms. In this paper, we propose the multi-agent policy distance (MAPD), a general tool for measuring policy differences in MARL. By learning the conditional representations of agents' decisions, MAPD can computes the policy distance between any pair of agents. Furthermore, we extend MAPD to a customizable version, which can quantify differences among agent policies on specified aspects. Based on the online deployment of MAPD, we design a multi-agent dynamic parameter sharing (MADPS) algorithm as an example of the MAPD's applications. Extensive experiments demonstrate that our method is effective in measuring differences in agent policies and specific behavioral tendencies. Moreover, in comparison to other methods of parameter sharing, MADPS exhibits superior performance.

action distribution, agent, policy distance, (16 more...)

arXiv.org Artificial Intelligence

2401.11257

Country:

Asia > China > Beijing > Beijing (0.05)
Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
South America > Brazil > São Paulo (0.04)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Games (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.48)

Add feedback